MiniLLM is a lightweight language model project that fully implements the entire process from pre-training → instruction fine-tuning → reward modeling → reinforcement learning, economically and efficiently building a chat model with basic conversational capabilities
Large Language Model
Transformers